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YEAST ARTIFICIAL CHROMOSOMES AND THEIR USE IN THE 
CONTROL OF GENE EXPRESSION 
Field of the Invention 

This invention relates to yeast artificial chromosomes 
5 and their manipulation and transfer into cells and animals, 
to exploit the control of gene expression, and also to the 
resulting cells and animals. 
Background of the Iyiyent&pn 

The ability to transform suitable hosts with foreign 

10 DNA, and thus to express gene products not normally 
produced by the host, is an important goal of 
biotechnological research. Microorganisms can be used to 
produce desired proteins, while higher animals having 
desirable characteristics have also been produced. For 

15 example, EP-A-0264166, WO-A-8800239 , WO-A-8801648 , WO-A- 
9005188 and W0-A-9211358 describe the use of lactating 
animals to express foreign proteins which are produced in 
the milk and can be isolated therefrom; this provides a 
very satisfactory, controlled source of pure protein. 

20 Techniques to transfer cloned DNA into mammalian cells 

and transgenic animals have greatly facilitated the study 
of gene regulation and expression. Gene transfection 
experiments have also highlighted the fact that the limited 
size of many cloned DNA molecules prevents the efficient 

25 use of the numerous distant regulatory sequences believed 
to control expression. 

More particularly, in order to investigate regulation 
of complex loci and chromosomal domains harbouring clusters 
of genes, it is essential to introduce very large pieces of 

30 DNA into cells and animals. The conventional approaches 
used in transgenic animal technology have limitations, 
making it difficult to introduce DNA fragments which are 
greater than 100 kb; see Bruggemann et al, Eur. J. Immunol. 
21:1323-1326 (1991). For example, while germ line- 

35 dependent genomic imprinting may affect large areas in 
which a number of genes may be similarly regulated, there 
is no efficient method to study such "imprinted" domains. 
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A satisfactory technique for introducing large DNA 
fragments would allow progress, and also facilitate the 
analysis of other complex loci such as the T-complex 
(harbouring specific deletions) for which a number of genes 
5 have been mapped which are crucial for mammalian 
development. In order to obtain a better understanding of 
the regulation of eukaryotic genes, it would be desirable 
to express these genes in their authentic genomic context 
after cloning and re-introduction into cells. 
L0 The expression of mammalian genes is controlled at 

various levels: the genomic context (influence of 
neighbouring genes), regulatory elements proximal to the 
exons (e.g. promoters) , regulatory sequences downstream of 
the termination codon (polyadenylation site) and regulatory 
15 motifs further away from coding sequences (e.g. enhancers) . 
For immunoglobulin genes, it has been shown that expression 
of transgenes is poor when enhancer motifs are missing, and 
that these can be several thousand base-pairs away (25 fcb 
for the heavy chain 3 --enhancer) from the nearest exon. 
20 Mouse models have been established to address the 

question of the immunogenicity of chimaeric, foreign and 
authentic antibodies used for therapeutic purposes; see 
Bruggemann et al, J. Exp. Med. 12fl:2153 (1989) . It became 
clear that only authentic proteins escape the surveillance 
25 of the immune system. 

• The techniques currently used for making human 
antibodies involve either in vitro immunisation and 
immortalisation of human lymphocytes or genetic 
engineering. The/selection of rare specific antibody- 
30 producing human lymphocytes outside the body is difficult 
and, once the lines are obtained, their yield and stability 
are poor; see Borrebaeck, Immunol. Today 9_:355 (1988). 
Genetic engineering (also termed "humanisation" of rodent 
antibodies) firstly has to be done for each individual 
35 mouse or rat antibody of therapeutic use and secondly does 
not yield completely human antibodies; see Riechmann et al, 
Nature 332: 323 (1988) .. "Humanising" existing rodent 
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antibodies already approved for therapy is currently the 
most successful way to obtain less immunogenic reagents. 
However, it would be a considerable improvement to have a 
mouse strain available which makes authentic human 
5 antibodies after immunisation with human materials. 

A repertoire of immunoglobulins has been obtained from 
transgenic mice carrying inserted human antibody gene 
segments in germ line configuration; see WO-A-9004036 and 
Bruggemann et PNAS 86:6709 (1989). A human mini IgH 

10 locus has been constructed with variable region genes (Vs) , 
diversity segments (Ds) , joining segments ( Js) and the \l 
constant region gene (C/i) . The human gene segments 
rearrange in the lymphoid tissue of these mice (VDJ-C/i) and 
antibodies with human \i heavy chains can be obtained after 

15 immunisation* However, the level of human IgM as opposed 
to mouse IgM is low, and specific hybridomas with human \i 
chains are rare after immunisation. A further complication 
is that most of those cells that produce human heavy chain 
also secrete endogenous mouse Ig. This means that 

20 rearrangement of human \i does not stop endogenous 
rearrangement; in other words, allelic exclusion is not 
achieved. Furthermore, the actual repertoire size of the 
produced human antibodies is unknown but might be small as 
the IgH construct contains only a limited number of V and 

25 D segments. 

Srivastava et al, Gene 103:53-59 (1991), describe 
plasmids which permit the insertion of neomycin-resistance 
gene into the human DNA insert or the vector arm of a YAC. 
In the latter case, the plasmid also contains a LYS2 gene 

3 0 for selection in a yeast host. The URA3 gene is then 
replaced by a new insert, thus inactivating URA. 

Bothstein et al, Science 240:1439-1443 (1988), 
describe the practical advantages of the yeast organism for 
cloning and. manipulating large DNA molecules. Green et al, 

35 Science 250 :94-98 (1990) , report the cloning of segments of 
up to one million base-pairs in yeast cells in artificial 
chromosomes (YACs) , allowing the long-range mapping and 
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analysis of complex genomes. Yeast vectors for cloning of 
large DNA molecules combine two features; plasmid sequences 
for their propagation in J. Mi and yeast specific 
sequences to ensure the replication and maintenance of a 
linear molecule when grown in yeast. 

Nevertheless , the prohlem of producing large molecules 
(of which specific examples are immunoglobulins, Factor 
VIII and Factor IX} on a commercial scale, remains. At the 
molecular level, it would be desirable to introduce large 
DNA molecules containing single genes of considerable size, 
in order to facilitate correct expression. For example, 
the human Ig heavy chain locus accommodating all Vs, Ds, Js 
and C regions is estimated to spread over 3000 kb of DNA; 
the Factor VIII gene is almost 200 kb in size with 23 
15 exons. At present, high level of expression is rarely 
achieved by the introduction of cDNAs and engineers genomic 
"minigenes" into transgenic animals. 

gmrnnarv of Tnvention 

Surprisingly, it has now been found that specific DNA 
20 of considerable length can be introduced into ES cells via' 
a YAC, without introducing DNA from the yeast. The need to 
purify the YAC out of yeast is thus avoided, and 
transformation is essentially foolproof. This discovery is 
applicable, to the introduction of such DNA into other cells 
25 also. 

The appropriate YAC, for use in the present invention, 
includes a foreign gene or gene locus (i.e. including one 
or more genes or gene segments) of at least 100 kb and also 
a marker gene which allows selection in cells that are not 
30 prokaryotic or yeast cells. This construct can be obtained 
by integration of the marker gene, e.g. Neo , into a YAC 
including the foreign gene. 

in a further aspect of the invention, ES cells or 
other cells are transformed by the marked YAC, and thus can 
be selected and inserted into, say, animals which 
consequently express human immunoglobulin or other genes, 
in particular, a transgenic lactating, ovine or bovine 
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animal, mouse or other rodent, or any non-human animal, may 
contain a foreign gene or gene locus of considerable 
length, e.g. at least 100 kb, often at least 300 kb, but 
also 1 Mb or more, if required. The animal may express the 
5 product of another animal, e*g. a human product, but not 
its own corresponding product. 
Description of the Invention 

The available YACs contain a marker gene that allows 
for selection in prokaryotic cells, e.g. for ampicillin- 
10 resistance, but do not contain a marker gene that would 
allow for selection for integration in eukaryotic, ES, 
somatic or mammalian cells. This invention utilises the 
recombination proficiency of yeast to introduce an 
antibiotic resistance into the YAC by recombination. For 
15 example, neomycin-resistance gene controlled by the 
thymidine kinase promoter ( tk neo) will allow selection in 
mammalian cells. Other suitable markers are hygromycin and 
HPRT-resistance. Depending on the circumstances, it may be 
desirable to prepare YACs with different markers, for 
20 integration. 

The present invention is based in part on the 
realisation that yeast vectors with dominant selectable 
marker genes allow targeted integration into left 
(centromeric) and right (non-centromeric) YAC arms as well 
25 as alterations to human-derived insert DNA. In 
transformation experiments, integration proceeds 
exclusively by homologous recombination, although yeast 
prefers linear ends of homology for predefined insertions. 
Targeted regions can be rescued which expedite the cloning 
30 of internal human sequences and the identification of 5' 
and 3 1 YAC/ insert borders. Integration of, say, the 
neomycin-resistance gene into various parts of the YAC 
allows the transfer and stable integration, and also 
rescue, of large DNA molecules into a variety of mammalian 
35 cells including embryonic stem cells. 

The present invention utilises YACs as cloning 
vehicles for the complete integration of DNA of a length 
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that has previously been difficult to transfer. The 
invention also allows specific modification of the DNA. 

It may be necessary to enlarge an existing YAC 
containing heavy and light chain genes. A variety of human 
5 heavy and light chain-containing cosmids is available; they 
may be added on to the YAC by homologous, site-specific or 
other integration- Further, YACs containing different or 
overlapping parts of defined loci may be crossed, in order 
to obtain a contiguous DNA molecule in one YAC. This 
10 allows a large part of the human heavy and light chain gene 
clusters to be reconstituted, and, to obtain transgenic 
animals which make authentic human antibodies. 

Similarly, a Factor IX-containing YAC may be modified 
in order to direct expression, for example, in milk. This 
15 may involve exchanging the Factor IX promoter .for a milk 
gene promoter, e.g. the murine whey acidiQ protein gene 
' .promoter, by homologous . recombination in yeast. Factor 
VIII is another product that can be produced in the same 
way. 

20 It is an important feature of the invention that the 

marker gene is incorporated into YAC in an active form, in 
order to allow selection. For this purpose, the YAC 
without that marker is subjected to integration with a. 
plasmid containing the marker gene and a- sequence outside 

25 the foreign gene, e.g. the ampicillin-resistance gene. 
This may lead to a YAC containing multiple copies of both 
markers, and duplication at least is preferred, but the 
important point is that, say, neomycin-resistance can be 
observed". 

30 The starting materials and techniques for use in the 

invention are generally known, or the materials can be 
prepared by known techniques. For example, several 
separately-derived embryonic stem (ES) cell lines are 
available: see Mansour et al, Nature 336:348 (1988); 

35 schwartzberg et al. Science 246:799 (1989) ; Johnson et al, 
science 2£5_:1234 (1989) ; and Zijlstra et al, Nature M2.:435 
(1989). Transformation and selection procedures have 
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already been established. Immunoglobulin genes on YACs are 
available, as is a YAC containing the Factor IX gene; these 
clones have been obtained independently by screening 
various YAC libraries; see Little et al, PNAS USA 86; 1598 
5 (1989) . The sizes for the YACs vary and are between 200 
and 600 kb. Cosmids may increase size to, say, 1*8 Mb. 

The methods concerned with the introduction of large 
DNA molecules into cells are microinjection and co- 
precipitation with calcium phosphate (using any naked DNA 

10 such as YACs, genomic DNA or dissected chromosomes) or 
protoplast fusion, using yeast protoplasts: see Oi et al , 
PNAS USA 80:825-829 (1983); Graham and van der Eb, Virol. 
52:456-658 (1973); and Richa xind Lo, Science 245:175-177 
(1989). The approach of directly injecting large DNA 

15 molecules into fertilised eggs or ES cells was difficult 
because of the nature of DNA (large molecules are sticky). 
It is preferred to introduce the various YACs by protoplast 
fusion; this, however, needs the introduction of a 
selective marker gene into the YAC (see below) . Another 

2 0 approach is the transfection of high molecular weight DNA 
or chromosomal DNA, mixed with selective marker DNA, into 
ES cells by calcium phosphate co-precipitation. 
Identification of integrated genes of interest could then 
be done by probing with human Alu repeat sequences/ 

25 confirming integration and size, as well as with specific 
probes. This random approach is similar to a library 
screening and depends on the transfection frequencies. 
Multiple copies of selective marker genes may thus be 
produced . 

30 It is a primary object of the invention to introduce 

large DNA molecules into the germ line of mice or other 
non-human animals, e.g. via ES cells. For the purposes of 
this invention, such large molecules should be introduced 
in germ line configuration. By "introducing them into ES 

35 cells, the germ line locus, e.g. for antibodies or 
immunoglobulins, rearranges in the lymphoid tissue of the 
animal, and thus antibody production takes place. 



PCT/GB92/01651 

WO 93/05165 

8 

Techniques outlined above make use of YACs that 
contain, for example, well-characterised parts of the human 
immunoglobulin light and heavy chain loci or the Factor IX 
gene. In addition, high molecular weight DNA carrying any 
5 (large) genes of interest may be transferred into a variety 
of cells, including ES cells which can and have been used 
to obtain transgenic mice. 

The purpose of introducing coding sequences and 
flanking regions on large DNA molecules is to preserve the 
10 original genomic context which facilitates the correct 
expression. These techniques alfeo allow the introduction 
and study of large gene families. In combination with gene 
targeting, authentic foreign proteins can be obtained in 
large yield without interference of the homologous 
15 endogenous gene products. In that way, an animal may be 
obtained with an immune system fin respect of antibody 
production) indistinguishable from that of man. 

The endogenous mouse antibodies may interfere with the 
transgenic human immunoglobulins. It may therefore be 
20 necessary to silence the mouse immunoglobulin heavy and 
light chain loci by gene targeting in ES cells. A mouse 
strain can then be obtained with a transgenic human 
immunoglobulin heavy and light chain gene cluster, that 
makes no antibodies of its own. 
25 Xn a particular example of the invention, a neomycin 

resistance cassette was integrated into a large (300 kb) 
YAC The YAC contains a well-characterised region of the 
human immunoglobulin (Ig) kappa (k> light chain locus. The 
modified IGK YAC was transferred into embryonic stem (ES) 
30 cell lines by spheroplast fusion. The approach is useful 
for the transfer and expression of large genes and gene 
families in their original genomic context into the 
germline of other species via ES cells. 

The k locus in humans is estimated to spread over. 2500 
35 thousand base-pairs of DNA. The genes of the 
immunoglobulin k light chain are assembled during B-cell 
differentiation by somatic recombination: one of the many 
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v * (V=variable) gene segments rearranges with one of the 
five J K (J=joining) segments, and a C s (C=constant region) 
polypeptide is transcribed. In order to investigate the 
expression of such complex loci f it is essential to 
5 introduce very large fragments, containing many gene 
segments as well as necessary regulatory sequences, into 
cells and animals. 

ES cells are suitable for gene manipulation 
experiments such as the introduction of large DNA molecules 
10 on YACs. Furthermore, ES cells can be reintroduced into 
blastocysts, and chimaeric and germline mice which carry 
the introduced loci can be derived from them. 

The invention is further illustrated by the specific 
description that follows, and also by the Materials and 
15 Methods section of Davies et al . Nucleic Acids Research 
20(11) :2693-2698 (1992). The contents of that article are 
incorporated herein by reference. 

The YAC containing the human k locus does not contain 
a marker gene which allows selection in ES cells, 
20 Therefore, in accordance with one embodiment of the 
invention, the neomycin resistance gene (neo) was 
integrated into the YAC- The neo gene permits selection of 
stable clones when transferred into mammalian cells. An 
example of the yeast-selectable marker gene that is also 
25 usually used is LYS2 which allows growth in lysine- 
deficient yeast media. Integration of exogenous DNA in 
yeast proceeds in an homologous fashion, and introducing 
new sequences into YACs, termed "retrofitting" by Eliceiri 
efc al, PNAS USA 88:2179-2183 (1991) is facilitated because 
30 of the plasmid homology region of the YAC arms. 

Targeted integration in yeast is either performed 
using replacement constructs or integration vectors; see 
Scherer et PNAS USA 76:4951-4955 (1979); Pavan £fc 
Mol. Cell. Biol. 10:4163-416.9 (1990), and PNAS USA 88 : 7788- 
35 7791 (1991) ; and Srivastava et ai, supra . Replacement 
vectors are designed to disrupt a region of homology by the 
insertion of exogenous DNA. In the particular example, 
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homologous integration into YACs was studied using 
integration vectors which recombine as a whole and 
duplicate a given target sequence without impairing its 
function, integration vectors can be rescued from YACs and 
5 also from transfected cells, as the bacterial sequences 
allow subcloning in bacteria, for example, and this permits 
th* isolation of flanking insert DNA. The universal 
principle of integration using a set of vectors, several 
targeting sites and different YACs is shown. The modified 

10 YACs have been stably introduced into embryonic stem cells: 
thus, the experiments show the feasibility, of transferring 
complex gene loci from one species to another. 

An important aspect of the evidence is that YACs can 
be used to transfer such loci into other cells, without 

15 transferring potentially-undesirable yeast DNA, while 
retaining the locus in its essential configuration ana 



size. 
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CLAIMS 

1. A cell that is not prokaryotic or a yeast cell, 
transformed with a foreign gene or gene locus of at least 
100 kb and also a marker gene which allows selection in the 

5 or another such cell. 

2. A cell according to claim 1, which is an embryonic 
stem cell. 

3. A cell according to claim 1 or claim 2, wherein the 
marker gene is in addition to an ampicillin-resistance or 

10 other prokaryotic selective marker gene and/or uracil- 
resistance or other yeast-selective marker gene. 

4. A cell according to any preceding claim, comprising at 
. least two copies of the or each marker gene, 

5. A cell according to any preceding claim, wherein the 
15. foreign gene or gene locus is in germ line configuration. 

6. A cell according to any preceding claim, essentially 
free of yeast DNA. 

7. A yeast artificial chromosome (YAC) including a 
foreign gene or gene locus of at least 100 kb and also a 

20 marker gene which allows selection in cells that are not 
prokaryotic or yeast cells. 

8. A YAC according to claim 7, wherein the marker is for 
resistance to neomycin, hygromycin or HPRT. 

9. A YAC according to claim 7 or claim- 8, wherein the 
25 foreign gene or gene locus and/ or the marker gene are as 

defined in any of claims 3 to 5. 

10. A transgenic non-human animal including a foreign gene 
or gene locus of at least 100 kb. 

11. A transgenic non-human animal capable of expressing a 
30 the product of another animal but not its own corresponding 

product. 

12. An animal according to claim 11, wherein said another 
animal is human. 

13. An animal according to claim 10 and also claim 11 or 
35 claim 12. 

14. An animal according to any of claims 10 to 13, which 
is a lactating animal and the gene product is isolatable 
from its milk. 
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